biotechMIogy 

SYSTEMS 
BRANCH 



RAW SEQUENCE LISTING 
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The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: . ■ :: ' 

Application Serial Number: 

Source: Ar/^f 

Date Processed by STIC : 

THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 

Pt F A<5F FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

S^ttTOWGAC^OT ™ PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 

APPLICANT. WITH A NOTICE TO COMPLY or, 
2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 

NOTTfF TO COMPLY 

FOR CRF SUBMISSION QUESTIONS, PLEASE CONTACT MARK SPENCER, 703-308-4212. 

FOR SEQUENCE RULES INTERPRETATION, PLEASE CONTACT ROBERT WAX 703-308-4216. 
PATENTIN 2.1 e-mail help: pntin2lhe.ln@usDto.gov or phone 703-306-41 19 (R ; Wax) 
PATENTIN 3.0 e-mail help: patin3heln@uSDto.gov or phone 703-306-4119 (R. Wax) 

TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
wwsiflN 3.0 PROGRAM. ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW; 

Checker Version 3.0 

The Checker Version 3.0 application is a state-of the-art Windows based software program 
employing a logical and intuitive user-interface to check whether a sequence listing is in 
compliance with format and content rules. Checker Version 3.0 works for sequence listings 
generated for the original version of 37 CFR §§1.821 - 1.825 effective October 1, 1990 (old 
rules) and the revised version (new rules) effective July 1, 1998 as well as World Intellectual 
Property Organization (WIPG) Standard ST.25. • 
Checker Version 3 .0 replaces the previous DOS-based version of Checker, and is Y2K- 
compliant Checker allows public users to check sequence listings in Computer Readable^torm 
(CRF) before submitting them to the United States Patent and Trademark Office (USPTO). 
Use of Checker prior to filing the sequence listing is expected to result in fewer errorcd sequence 
listings, thus saving time and money. 

Choker Versio" T n r.« he rinwn loaded from the USPTO website at the following address: 

http://www.uspto.gov/web/offices/pac/checker 



Raw Sequence Listing Error Summary 



ERROR DETECTED SUGGESTED CORRECTION 



SERIAL NUMBER: 



ATTN: NEW RULES CASES: PLEASE DISREGARD ENGLISH "ALPHA** HEADERS, WHICH WERE INSERTED BY PTO SOFTWARE 



1 



^Wrapped Nucleics The number/text at the end of each line '"wrapped" down to the next line. This may occur if your file 
Wrapped Aminos was retrieved in a word processor after creating it. Please adjust your right margin to .3; this will 
prevent "wrapping." 

Invalid Line Length The rules require that a line not exceed 72 characters in length. This includes white spaces. 

Misaligned Amino The numbering under each 5 th amino acid is misaligned. Do not use tab codes between numbers; 
Numbering use space characters, instead. 

^Non-ASCII The submitted file was not saved in ASCII(DOS) text, as required by the Sequence Rules. Please 

ensure your subsequent submission is saved in ASCII text 

_ Variable Length Sequence(s) contain n's or Xaa's representing more than one residue. Per Sequence Rules, 

each n orXaa can only represent a single residue. Please present the maximum number of each 
residue having variable length and indicate in the <220>-<223> section that some may be missing. 



_?atentln 2.0 
"bug" 



Skipped Sequences 
(OLD RULES) 



A "bug" in Patentln version 2.0 has caused the <220>-<223> section to be missing from amino acid 

sequences^) - Normally, Patentln would automatically generate this section from the 

previously coded nucleic acid sequence. Please manually copy the relevant <220>-<223> section to 
the subsequent amino acid sequence. This applies to the mandatory <220>-<223> sections for 
Artificial or Unknown sequences. , 

Sequence(s) missing. If intentional, please insert the following lines for each skipped sequence: 

(2) INFORMATION FOR SEQ ED NO:X: (insert SEQ ID NO where "X" is shown) 
(i) SEQUENCE CHARACTERISTICS: (Do not insert any subheadings under this heading) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:X: (insert SEQ ID NO where "X" is shown) 
This sequence is intentionally skipped 

Please also adjust the "(ii) NUMBER OF SEQUENCES:" response to include the skipped sequences. 

missing. If intentional, please insert the following lines for each skipped sequence. 



_Skipped Sequences Sequencers) 

(NEW RULES) <2I0> sequence id number 
<400> sequence id number 
000 



10 



11 



12 



_Use of n's or Xaa*s 
(NEW RULES) 



JLn valid <213> 
Response 



Use of <220> 



Patentln 2.0 
"bug" 



Use of n's and/or Xaa's have been detected in the Sequence Listing. 

Per 1.823 of Sequence Rules, use of <220>-<223> is MANDATORY if n's or Xaa's are present. 

In <220> to <223> section, please explain location of n or Xaa, and which residue n or Xaa represents. 

Per 1.823 of Sequence Rules, the only valid <213> responses are: Unknown, Artificial Sequence, or 
scientific name (Genus/species). <220>-<223> section is required when <2 1 3> response is Unknown or 
is Artificial Sequence 

Sequence(s) missing the <220> "Feature" and associated numeric identifiers and responses. 

Use of <220> to <223> is MANDATORY if <2 1 3> "Organism" response is "Artificial Sequence" or 

"Unknown." Please explain source of genetic material in <220> to <223> section. 

(See "Federal Register," 06/01/1998, Vol. 63, No. 104, pp. 29631-32) (Sec. 1.823 of Sequence Rules) 

Please do not use "Copy to Disk" function of Patentln version 2.0. This causes a corrupted file, 
resulting in missing mandatory numeric identifiers and responses (as indicated on raw sequence 
listing). Instead, please use "File Manager" or any other manual means to copy file to floppy disk. 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/0 9/8 5 6 , 74 9 

Input Set : A:\Dc-120.app 
_X^it^ua±_SetjN : \CRF3\06182001\I8 5674 9 . raw 



DATE: 06/18/2001 
TIME: 16:20:37 
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Homo sapiens 
1 



Homo sapiens 
2 



<140> CURRENT APPLICATION NUMBER: 
<141> CURRENT FILING DATE: 2001-0 
<151> PRIOR FILING DATE: 1998-11- 
<110> APPLICANT: 
<120> TITLE INVENTION 

HQJt— FI L E REFE 
<160> NUMBER OF SEQ ID NOS : 10 
<170> SOFTWARE: Patentln Ver . 2.0 
<210> SEQ ID NO: 1 
<211> LENGTH: 5 
<212> TYPE: DNA 
<213> ORGANISM: 
<400> SEQUENCE: 
aagat 

<210> SEQ ID NO 
<211> LENGTH: 6 
<212> TYPE: DNA 
<213> ORGANISM: 
<400> SEQUENCE: 
aaggat 

<210> SEQ ID NO: 3 

<211> LENGTH: 4438 

<212> TYPE: DNA 

<213> ORGANISM: Homo sapiens 

<400> SEQUENCE: 3 

cctcacatat ttcaaatcca tctcaaattc 
cggttttgac agggctgaac tgagctatgg 
ttggtttgaa tttccgggaa aaggagctat 
aactataagc attttccaca gtgtttaata 
aacaaattaa taaaatgctc aaaataatct 
atcaccaaaa ataaatgtgc tatgcttcat 
gctcacacct ataatcccaa cactctggga 
gagtttgaga ctagcctggc caacatagtg 
tcagtgggac atggtggtgc acatctgtag 
gattgcttaa gcccaagagt ttgaggtccc 
cttgtctcta aataaataaa ttagttaatt 
cttcccatgg taattttaaa aactttagtc 
ttcttataaa caactcagcc taatgagaaa 
tctatttctt cattgatcta ttcatttatt 
actatgttga ggtagtataa actataaatt 
ttgagtgaca gcttgaaggg gaggattctt 
agacattatt gtggacttga gggagttaat 
tctctgttag ggaagcaaga tttctatccc 
agtggcacag aactgtgttc aacgagtgac 
ggttaattga tcaatcattt ctatccagaa 
gtaccaagtg tctgttaagt gtctggtcaa 
atgtaggaag aatacacacc gtgagcaaat 



US/09/856/ 
5-24 
3 0 



Does Not Compiy 

Corrected Diskette Needed 



acattcacag 
tatgagtagc 
agctgcaaaa 
aaccatgcag 
gatactaaat 
ataaaatctc 
ggccgaggtg 
agacctcatc 
ttctagctac 
tacactccag 
gaatgtccag 
ttaggagagt 
tagaccctgt 
aactcctgta 
caacaagttt 
tcaggcctgt 
gtgacagtcc 
cagagtatgt 
taccgctctg 
ggtaaccatg 
tggttatcca 
ttttcccacg 



atgtaagagc 
actcatcccc 
atctgtttca 
ataagaaaat 
gcttgtagca 
cagtaaggct 
agagaactgc 
tctacaaaaa 
ttgggagtct 
cctaagcgac 
tcagttgata 
aaaagtcatg 
atttaagtgt 
acaatcattt 
gataagggaa 
gggaccgggt 
tcgtgtctcc 
atgtgttatg 
ctgtgtgccc 
aggactgacg 
taaagctact 
tgtaactctc 



tgggaaagga 
agaaagtctc 
caaatgtgct 
attattgaca 
tggcatgcaa 
gggtgtggtg 
ttgaggccag 
atcttaaaaa 
gaggcaggaa 
agagggagac 
tatccaaatt 
gacataagac 
catttaagta 
gcagacacct 
ataagagaga 
ggtggcatgg 
agacactttc 
tctggactgc 
tgggacttgg 
gaaccagtgt 
gcatggccat 
acaacaaaat 



60 

120 

180 

240 

300 

36 0 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 



file://C:\CRF3\Outhold\VsrI856749.htm 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/0 9/856,74 9 
Input Set : A:\Dc-120.app 

Output Set: N:\CRF3\06182001\I8 56749.raw 



DATE: 06/18/2001 
TIME: 16:20:37 



52 agcattaaat acttaatgtt tctggctaaa gaccatttca agacttgcag gacaaaaaaa 1380 

53 tagaaaaaat atctgacact caaatggagt tacaaaatta aaacggctga attccccagc 1440 

54 ataaaaaaat atgaagcaag attgaaattt caagactaag tttaatatgg aaaaatacaa 1500 

55 atatgtttga ggcctttcac agagcagcca gcatgaagca accaagaaaa ccacggaaat 1560 

56 aatctggctg cctggaaata gtccggagtc agctgacaca gccacacgag agccctctta 1620 

57 tgcttgtcat aaggggtaaa ggaataattt cagaaaatta catttaaaag agaattatgg 1680 

58 gggaagaaga tgctcccaga ggaaacaaat agtatggatg tgaagagcaa atacaacttt 1740 

59 aacatgtttt gaacttcttg gaaactatgc taagtttagg cattgctagg atttggtatg 1800 

60 atttaatccc cagctttctg ttctaaattt ttgttttctt ttttactctc aaataaatca 18 6 0 

61 tatgctagca ccagctgcaa agttacatat gttgtattag acgatcttcc atgaatacct 1920 

62 aactggaaat tccaagattc agggccatgt gaatctaggc tggctgctta accaaaactt 1980 
6 3 aatttaattt ttttcgttta ttttaggaaa aaaaattaac gaaaagatgt ttcaagcaac 204 0 

64 cagtttccaa tccacgtcag caactatgac atttaatgaa acactgtgag catttagcat 2100 

65 gagagctctg gactcagatg cagggagctt tgctagagaa gggaggaaaa agcaggcatg 2160 

6 6 atgtggcggg ttgtggggga ctccaaggct ctatttccaa cttccatcag agaacttctg 222 0 

67 ttttcacctg gttttcaaat ttgctttcca aaagggattt tgtttaagta aaggatacag 2280 

68 aggtttataa aagtttgaaa acttctacat tgcaggatgt gcaggctctt gccagatggg 2 340 

69 acagtgtatg agactcttcc agggtgacgt cttaggcaat ttcctgtcca atcacagatg 2400 

70 gtcacatgct gctttcctga gttaacctat taactcaccc ttgtttccca ggcctcagtg 2460 

71 gagctaggct tgtcacgtct tcacagtgac tagattccct cacagtcgag tatatctgcc 2520 

72 actccttgac ttttaaaaca tagtctatgt tcaccctcta atatgaagag cccctttcac 2580 

73 tattttcttt gtctgtgctg gagtcacttc agtggcaagt gttctttggt ctctgccgca 2 64 0 

74 ccctccctct gatgcctctg agaagaggat ttccttttcg tgagaatgtc ttcccattct 2700 

75 tcttaccctc ttgaactcac atgttatgcc acttagatga ggaaattgta gttaaataat 2760 

76 tagaaagata tgacttatct caaatcaatc caagatatac tgaagtattg tttatgagta 2820 

77 agatatcagt cttgacgcag aaagaaaaca ggaatccata aggggaggaa agtgttgaaa 2 880 

78 agcaaacctg atacagtggg aaaggtggga gacaccataa ggtgctgaag tgataaaaca 2 940 

7 9 ggccagtgtt tctccactgt atgttttcaa taaatgcttc caaggaagga gagtggggca 3 000 

80 tgagtagggg agctacagag ataaaccaac ttttcttacc aggaatgcta cagatagcac 3060 

81 tggtgacacc ggtcaccagt acccaagaca atttaatgtg gaacataagt acaggaatac 3120 

82 acatctttca ttacagagcc atgtatttat tttaatgggc aggagatgct aaataagatc 3180 

83 ttttgaatgg aggaatgcat aaatatatga atgaatgcat acatgaaaga ataaataaat 3240 

84 gctgcctagc accaaggagc gaagatagac tcatatcaag ggaaacaagt atgattaaaa 3 300 

85 ataagacccc agagtcacgc tcagtctctt tccagccttt tcatcatccg gtacattcag 3 360 

86 acaagtttca gggaaggatc ctatttgtcc catgataatg atgggcaagg ggtggggagt 3420 

87 tatctcatac tccgcctgtg gatgaggggt cttctcaggt aaggctctta aatcctaggc 3480 

88 ctgagtaaat tttttcaaat tttattttag acagggtccc tctctgttgc ctaggctgga 3 540 

89 gtgcagcggc acaatcacag ctcaatgcag cctcaacctc ccaggcccaa gtgatcctcc 3600 

90 cacctcagcc tcttcagtga ctaggactac aggtgcatga ctccatgctt ggctaacttt 3660 

91 aaaaaatgtt tgtttgtttg tttgtttttt acagagatgg ggtctcacca tgttgcccag 3 72 0 

92 gctgatcttg aactcctggg ctcaagtgat tcccctgcct cggcctcctg aaattctggg 3780 

93 attataggct tgagccacca tgcctggctc tgagtaaaga ttaagggaag ccatggtgct 3 840 

94 atcgcaatag ggtaccaggc agcttaacaa aggcagaagg gaacctcaga gaaccccgaa 3900 

95 gagccaccgt aaagtgagtg ctgggggagc tgaacttcag tcagtacagg agccgaacag 3960 

96 ccatcaggtg cgcagtgtta gtaattccac cctctgccct gggagcaagg tgtgtggaga 4020 

97 aacctgtagc actttatgac catcagaacc agcctttttc aaaaagacca tggagtactc 4080 

98 tttgacctgt gtatataaca agaacctttc tcaaatagga aagaaatgaa ttggagaaaa 4140 

99 ccactgttta catggcagag tgtgtctcct tcgcacacat cttgtttgaa gttaatcatg 4200 
10 0 acattgcaac accaagtgat tccaaataat ctgctaggag tcaccatttc taatgattgc 42 60 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/0 9/8 56 , 74 9 



DATE: 06/18/2001 
TIME: 16:20:37 



Input Set : A:\Dc-120.app 

Output Set: N:\CRF3\0 6182001\I856749.raw 



101 ctagtctatt catagctaat caagaggatg ttataaagca tgagtcagac acctctggct 4320 

102 ttctggaagg gcaaggactc tatatataca gagggagctt cctagctggg atattggagc 4380 

103 agcaagaggc tgggaagcca tcacttacct tgcactgaga aagaagacaa aggccagt 4438 

105 <210> SEQ ID NO: 4 

106 <211> LENGTH: 31 

107 <212> TYPE: DNA 

108 <213> ORGANISM: Artificial Sequence 

110 <220> FEATURE: 

111 <223> OTHER INFORMATION: Description of Artificial Sequence : Synthetic 

113 <400> SEQUENCE: 4 

114 gtggaagctt acacctataa tcccaacact c 31 

116 <210> SEQ ID NO: 5 

117 <211> LENGTH: 33 

118 <212> TYPE: DNA 

119 <213> ORGANISM: Artificial Sequence 

121 <2 2 0> FEATURE: 

122 <223> OTHER INFORMATION: Description of Artificial Sequence : Synthetic 

124 <400> SEQUENCE: 5 

125 ctgcctggta ccctattgcg atagcaccat ggc 33 

127 <210> SEQ ID NO: 6 

128 <211> LENGTH: 35 

129 <212> TYPE: DNA 

130 <213> ORGANISM: Artificial Sequence 

13 2 <220> FEATURE: 

133 <223> OTHER INFORMATION: Description of Artificial Sequence : Synthetic 

135 <400> SEQUENCE: 6 

136 aaataattag aaagatatga cttatctcaa atcaa 35 

138 <210> SEQ ID NO: 7 

139 <211> LENGTH: 36 

140 <212> TYPE: DNA 

141 <213> ORGANISM: Artificial Sequence 

14 3 <2 2 0> FEATURE: 

144 <223> OTHER INFORMATION: Description of Artificial Sequence : Synthetic 

146 <400> SEQUENCE: 7 

147 aaataattag aaaggatatg acttatctca aatcaa 36 

149 <210> SEQ ID NO: 8 

150 <211> LENGTH: 51 

151 <212> TYPE: DNA 

152 <213> ORGANISM: Artificial Sequence 

154 <220> FEATURE: 

155 <223> OTHER INFORMATION: Description of Artificial Sequence : Synthetic 

157 <400> SEQUENCE: 8 

158 ttcattgtta atcaagagga tgttataaag catgagtcac accctcagct t 51 

160 <210> SEQ ID NO: 9 

161 <211> LENGTH: 20 

162 <212> TYPE: DNA 

163 <213> ORGANISM: Artificial Sequence 

165 <220> FEATURE: 

166 <223> OTHER INFORMATION: Description of Artificial Sequence : Synthetic 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/8 56,74 9 



DATE: 06/18/2001 
TIME: 16:20:37 



Input Set : A:\Dc-120.app 

Output Set: N:\CRF3\06182001\l85674 9.raw 



168 <400> SEQUENCE: 9 

169 gttatgccac ttagatgagg 20 

171 <210> SEQ ID NO: 10 

172 <211> LENGTH: 20 

173 <212> TYPE: DNA 

174 <213> ORGANISM: Artificial Sequence 

176 <220> FEATURE: 

177 <223> OTHER INFORMATION: Description of Artificial Sequence : Synthetic 

179 <400> SEQUENCE: 10 

180 ttcctcccct tatggattcc .20 
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|g ^ '-x^ atmw r 

<151> 1998-11-30 

□ 

□ 

<160> 10 
□ 

□ 

<170> Patentln Ver. 2.0 



(/UU. fatte n ^7£2) 



• 



VERIFICATION SUMMARY 

PATENT APPLICATION: US/0 9/8 5 6 , 74 9 



DATE: 06/18/2001 
TIME: 16:20:39 



L:2 
L:2 



270 C 

271 C 



L : 2 M: 289 W 

L: 0 M: 201 W 

L: 0 M: 201 W 

L : 0 M: 201 W 



Input Set : A:\Dc-120.app 

Output Set: N:\CRF3\06182001\l856749.raw 

Current Application Number differs, Replaced Current Application No 
Current Filing Date differs/ Replaced Current Filing Date 
Identifier Missing or Out-Of -Order , <150> PRIOR APP NO 
Mandatory field data missing , APPLICANT NAME 
Mandatory field data missing, TITLE INVENTION 
Mandatory field data missing/ FILE REFERENCE 
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